Eecient Online Non-parametric Density Estimation 1 Overview and Motivation

نویسندگان

  • Christophe G. Lambert
  • Scott E. Harrington
  • Nathan D. Bronson
چکیده

Non-parametric density estimation has broad applications in computational nance especially in cases where high frequency data are available. However, the technique is often intractable , given the run times necessary to evaluate a density. We present a new and eecient algorithm based on multipole techniques. Given the n kernels that estimate the density, current methods take O(n) time to directly sum the kernels to perform a single density query. The cumulative O(n 2) running time for n queries makes it very costly, if not impractical, to compute the density for large n. Our new Multipole-accelerated Online Density Estimation (MODE) algorithm is general in that it can be applied to any kernel (in arbitrary dimensions) that admits a Taylor series expansion. The running time reduces to O(log n) or even constant time, depending on the kernel chosen, and hence, the cumulative running time is reduced to O(n log n) or O(n), respectively. Our results show that the MODE algorithm provides dramatic advantages over the direct approach to density evaluation. For example, we show using a modest computing platform that on-line density updates and queries for one million points and two dimensions take 8 days to compute using the direct approach versus 40 seconds with the MODE approach. The usual approach in nance to estimating a relation between n variables is to make distribu-tional assumptions about the data generating process or to directly impose parametric restrictions on the functional relation. Recently, however, there is considerable interest in an alternative approach: non-parametric density estimation. This approach \lets the data speak for itself." Rather than imposing assumptions, the non-parametric technique allows us to directly approximate the d-dimensional density that describes how these variables interact. Non-parametric density estimation techniques have been used by A t-Sahalia and Lo 1] to estimate the state-price density for options, by Boudoukh, Richardson, Stanton, and Whitelaw 7] and Harvey 15] to design hedging 1

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

International Association of Financial Engineers First Annual Computational Finance Conference

Non-parametric density estimation has broad applications in computational nance especially in cases where high frequency data are available. However, the technique is often intractable , given the run times necessary to evaluate a density. We present a new and eecient algorithm based on multipole techniques. Given the n kernels that estimate the density, current methods take O(n) time to direct...

متن کامل

Eecient Non-parametric Estimation of Probability Density Functions

Accurate and fast estimation of probability density functions is crucial for satisfactory computational performance in many scientiic problems. When the type of density is known a priori, then the problem becomes statistical estimation of parameters from the observed values. In the non-parametric case, usual estimators make use of kernel functions. If X j ; j = 1; 2; : : : ; n is a sequence of ...

متن کامل

Multivariate online kernel density estimation with Gaussian kernels

We propose a novel approach to online estimation of probability density functions, which is based on kernel density estimation (KDE). The method maintains and updates a non-parametric model of the observed data, from which the KDE can be calculated. We propose an online bandwidth estimation approach and a compression/revitalization scheme which maintains the KDE’s complexity low. We compare the...

متن کامل

تخمین احتمال بزرگی زمین‌لغزش‌های رخ‌داده در حوزه آبخیز پیوه‌ژن (استان خراسان رضوی)

Knowing the number, area, and frequency of landslides occurred in each area has a prominent role in the long-term evolution of area dominated by landslides and can be used for analyzing of susceptibility, hazard, and risk. In this regard, the current research is trying to consider identified landslides size probability in the Pivejan Watershed, Razavi Khorasan Province. In the first step, lands...

متن کامل

Kernel Estimation in High-Energy Physics

Kernel Estimation provides an unbinned and non-parametric estimate of the probability density function from which a set of data is drawn. In the first section, after a brief discussion on parametric and non-parametric methods, the theory of Kernel Estimation is developed for univariate and multivariate settings. The second section discusses some of the applications of Kernel Estimation to high-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007